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Abstract 

We study the problem of computing approximate Nash equihbria (e-Nash equi- 
hbria) in normal form games, where the number of players is a small constant. 
We consider the approach of looking for solutions with constant support size. It 
is known from recent work that in the 2-player Nash equilibrium can be 

easily found, but in general one cannot achieve a smaller value of e than i. In 
this paper we extend those results to the fc-player case, and find that e = 1 — ^ 
is feasible, but cannot be improved upon. We show how stronger results for the 
2-player case may be used in order to slightly improve upon the e = 1 — obtained 
in the fc-player case. 

1 Introduction 

A game in normal form has k players, and for each player p a set of pure strategies. 
In this paper we assume that all sets are of the same size n. The set S of pure 
strategy profiles is the cartesian product of the sets S^. For each player p and each 
s G 5, the game has an associated value uf , being the utility or payoff to player p if 
all players choose s. Note that the number of quantities needed to specify the game 
is k ■ n'^, so we must take A: to be a constant, for the game's description to be of size 
polynomial in n. 

A mixed strategy for player p is a probability distribution over S^. Suppose that 
each player p chooses distribution over S^. Let us define a player's regret to be the 
highest payoff he could obtain by choosing a best response to the other players' mixed 
strategies, minus his actual expected payoff. Then, a Nash equilibrium is a set of D^'s 
for which all players' regrets are zero. We say that the distributions form an e-Nash 
equilibrium provided that all players' regrets are at most e. 



*This work was supported by DFG grant VO 889/2, EPSRC Grant GR/T07343/02, and by the EU 
within the 6th Framework Programme under contract 001907 (DELIS). 
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1.1 Recent work 

Due to the apparent difficulty of computing Nasli equilibria exactly [U El Ej, recent 
work has addressed the question of polynomial-time computability of approximate Nash 
equilibria, in particular the e-Nash equilibria defined above. The effort has mostly 
addressed 2-player games, and the general question is, for which values of e can e-Nash 
equilibria be found in polynomial time? The payoffs u§ in the games are restricted 
to lie in the range [0, 1], since otherwise the payoffs (and associated values of e) could 
be rescaled arbitrarily. Recent papers include |T3j, which show how to efficiently 
compute 2-player e-Nash equilibria for e = 0.36392 and 0.3393 respectively. We do not 
know of similar work so far that addresses the A;-player case considered here. 

The support of a probability distribution is the number of elements of its domain that 
have non-zero probability. Solutions of games having small support (that is, players' 
mixed strategies do not give positive probability to many of the pure strategies) are 
attractive for two reasons. From the perspective of modelling a plausible outcome, we 
expect a participant in a game to prefer a simple behaviour. Also, if constant-support 
solutions exist, then if the number of players is constant, we can find them in polynomial 
time by brute force. For example, this approach is used by Barany et al. [H] to find Nash 
equilibria in random games. Our results show that additional players makes it genuinely 
more intractable to find a satisfactory solution; if we restrict the support sizes to any 
constant then the worst-case regret of some player increases. 

It is known that for 2 players, if we restrict ourselves to solutions with constant 
support, then there is a lower bound of ^ on the best e that can be achieved [lOj, 
and this lower bound is achieved by a very simple algorithm of Daskalakis, Mehta and 
Papadimitriou [7j. 

Regarding the issue of how large the support size needs to be in order to allow an 
e-Nash equilibrium, it is shown by Althofer [TJ and Lipton et al. [12] that log(n)/e^ 
support size is sufficient, and [12] point out that the general method works for any 
constant number k of players. It follows that the brute-force "support enumeration" 
algorithm takes time 0{'n}°^"') for any e > 0. 

Very recently, Hemon et al. [TT] studied independently from our work the problem 
of computing approximate Nash equilibria in games with more than two players. The 
results they obtain are very similar to the results presented in this paper. In particular, 
they also show that for games with k players a (1 — ^)-Nash equilibrium with constant 
support size can be computed efficiently and that this is the best possible which can be 
achieved with constant support strategies. Additionally, they also show that support size 
0(log(n)/e^) is sufficient for computing e-approximate Nash equilibria in the additive 
and multiplicative sense. 

1.2 Our Results 

We give a simple algorithm, essentially an extension of [7] from the 2-player case to the 
/c-player case, that finds (1 — ^)-Nash equilibria in which each player's mixed strategy 
has support size at most 2. Notice that this result becomes quite weak as k increases, 
given that any set of mixed strategies constitutes a 1-Nash equilibrium (recall that we 
assume all payoffs lie in the range [0, 1]). However, we also show that one can do no 
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better for constant support strategies. The argument is a kind of generalisation of the 
one of [TU] that gives the result for the 2-player case. 

For unrestricted strategies, we give a method that uses any algorithm for the 2- 
player case as a component, and provides slightly better values of e than the above. 
Finally, we show how the bounded differences inequality can be used to give a simplified 
proof of a result of [12], that for a constant number of players, O(logn) support size is 
sufficient for finding approximate equilibria. This allows us to deduce that if the number 
of players is less than y^, there is a subexponential algorithm for finding approximate 
Nash equilibria. 

2 Details of Results 

2.1 Solutions with Constant Support 

In this section we generalise the results of [71 10] from e = ^ in the 2-player case, to 
e = 1 — ^ in the /c-player case. 

Definition 1. Define a winner-takes-all game to be one in which, for any combination 
of pure strategies, one player obtains a payoff of 1, and the other players obtain payoffs 
ofO. 

In the 2-player case, winner-takes-all games are win-lose games where the payoffs 
sum to 1. Our generalisation of the lower bound of [10] follows their approach in that 
we generate a random winner-takes-all game and show that for support sizes limited by 
some constant t, for any e < 1 — ^, if the game is large enough, there will be positive 
probability that any solution with supports at most t is not e-Nash. 

Definition 2. Define the total support of a mixed- strategy profile to be the sum over 
all players p, of the number of strategies of p that p assigns non-zero probability. 

Theorem 3. Let t and k be any positive integers, and let e = 1 — p There exist games 
having k players such that in any e-Nash equilibrium, there must be at least one player 
whose mixed strategy has support greater than t. (In particular, there exist games with n 
strategies for each player such that in any e-NE at least one player must have a strategy 
with support Vl{ ''"^log n) .) 

Proof. We construct a /c-player winner-takes-all game Q uniformly at random as follows. 
Let each player have n pure strategies (a suitable value of n will be identified later) . For 
each combination of pure strategies, choose one of the k players uniformly at random 
and let that player have an associated payoff of 1, while the remaining players have 
payoffs of 0. 

We prove that for large enough n, with positive probability, the resulting game has 
the desired property. 

Consider strategy profiles with total support t. There are less than {^^^ possible 
support sets. 

We choose n large enough such that for any individual support set of total size t, 
the probability (with respect to random generation of Q) that all players have regret 
less than e is less than l/f^""), so that we can apply a union bound. 
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Given any mixed-strategy profile, we know that there exists at least one player who 
has an expected payoff of at most 1/fc, since the payoffs always sum to 1. Fix a support 
set S of total size t. We show that all players (including in particular the one with 
lowest expected payoff) have (with positive probability) a pure strategy that, if they 
unilaterally defected to it, would give them payoff 1 (for any mixed profile using S). 

For each player p, there are n pure strategies to choose from. In order for some given 
strategy (available to player p) to have expected payoff of 1 with respect to the other 
players' strategies, a sufficient condition is that the payoff entries corresponding to 
and the < t — 1 strategies in use by the other players, should let player p win. There 
are fewer than t^~^ pure-strategy combinations that the other players can select with 
non-zero probability. 

The probability that wins against a fixed pure-strategy combination is 1/k, so 
the probability that wins against all pure-strategy combinations of the other players 
is k~^^ \ Hence, the probability that player p has no strategy that guarantees him a 
payoff of 1 is 

[i-k-^^'-'^y . 

The probability that there exists a player p without a winning strategy is at most 



k . [i - k~i^'~'^y . 



The probability that there exists a support set for which there exists a player p 
without a winning strategy is at most 

We can upper bound this probability by 



Let us set t = y/aAog^Jn) for some constant a > 0. Then the term simplifies to 

A: - (A;n)'"\/"i°SfeW . (i-n"")" . (1) 

For a = 1/2, we can estimate (1 — n~°')"' by and hence ^ tends to zero when n 

tends to infinity. 



This ensures that if the size of the support set is bounded from above by ''"{/log;,(n)/2, 
then, for large enough n, with positive probability each player has a pure strategy with 
payoff 1, regardless of the choice of the support set. 

Hence, we require a total support size of $!( ''"^/log n) to ensure an approximation 
performance of better than 1 — 1/k, which for two players agrees with the upper bound 
of il2]. □ 



Corollary 4. // we restrict strategies to have constant support, there is a lower bound 
of 1 — ^ on the approximation quality that we can guarantee. 
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Note that the above is a generahsation of a lower bound of [U [TO] from the 2-player 
case to the fc-player case. We show that this lower bound is optimal by giving an 
algorithm (which is an extension of an algorithm of [7j) that finds a 1 — ^-approximate 
Nash equilibrium where each player has support size at most 2. 

Theorem 5. Let Q be a k-player game with n strategies per player. Let e = 1 — p There 
is a mixed strategy profile in which each player has support at most 2 which constitutes 
an e-Nash equilibrium. 

Proof. The following algorithm achieves the stated objective. 

1. Number the players 1, . . . ,k. For i = 1, . . . ,k — 1 in increasing order, player i 
allocates a probability of 1 — to an arbitrary pure strategy Sj. 

2. Player k allocates probability 1 to a pure best response bk to the strategy combi- 
nation (si, . . . , Sfc-i) of players 1, . . . , fc — 1. 

3. For i = k — l,...,lm decreasing order, player i allocates his remaining probability 

to a pure best response bi to the mixed strategy formed so far, i.e., to 
the strategy (si, . . . , rj+i, . . . , r^), where rj denotes the mixed strategy (1 — 

k+l-j) ■ + (fc+l-j) ■ 

In order to prove the theorem, we compute the regret of a player i £ {1, . . . ,k}. To 
avoid case distinctions, we denote by an arbitrary pure strategy of player k. If player 
i plays strategy Si, which happens with probability 1 — -j^^^rzi^ his regret can only be 
bounded by 1. If player i plays strategy bi, then his regret is if all players j < i play 
strategy Sj and his regret can be as bad as 1 otherwise. Altogether, this implies that 
the regret of player i is bounded by 

Pr [i plays Si] + Pr [i plays bi] • Pr [3j < i : j plays bj] 



□ 




2.2 Solutions with non-constant support 

The general issue of interest is the question of what approximation guarantee can we 
obtain for a polynomial-time algorithm for /c-player approximate Nash equilibrium. To 
what extent can we improve on the above (weak) result when we allow unrestricted 
mixed strategies? We do not have a very substantial improvement, but the following 
shows how the trick of [7] can be combined with better 2-player algorithms to get an 
improvement over the (1 — ^)-result that the above would yield for k players. 

Theorem 6. If 2-player e- approximate Nash equilibria can be found in polynomial time, 
then k-player ^^_^j_|^_2)^ -Nash equilibria can be found in polynomial time. 

As a corollary, the approximation guarantee of 0.3393 associated with the algorithm 
of |13j gives a 3-player algorithm with an approximation guarantee of 0.6022. 
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Proof. We prove the theorem by induction on the number of players. Let dk = • 
For k = 2, which is the induction basis, 62 = e. Now assume that there is an algorithm 
Ak-i with approximation guarantee 5k-i for {k — l)-player games. The following algo- 
rithm for fc-player games achieves the stated approximation guarantee 5k- 

1. Player 1 allocates a probability of 1/(2 — Sk-i) to some arbitrary pure strategy s. 

2. Players 2, . . . ,k apply Ak^i to the {k — l)-player game that results from letting 
player 1 play s. 

3. Player 1 allocates his remaining probability to a pure best response b to the strate- 
gies of players 2, . . . ,k chosen in step 2. 

A player i G {2, ... A;} has regret at most 5k-i if player 1 plays strategy s and his 
regret can be as bad as 1 otherwise. Hence, the regret of a player i G {2, . . . k} can be 
bounded by 

Sk-i + 1 



— Sk-iJ V 2 — 2 - 6k-i 

Player 1 has no regret when playing b and his regret can be as bad as 1 when playing s. 
This implies that also the regret of player 1 can be bounded by 1/(2 — A simple 

calculation shows 

1 _ 1 _ {k-2)-{k- 3)e 

as desired. □ 

Finally, we use the bounded differences inequality to give a simplified proof of a 
result of lia. 



Theorem 7. Let Q be a k-player game in which each player has n pure strategies. 
Let e > 0. Then there exists a mixed strategy profile where each player has support 
A;^ log(A:n)/2e^ that constitutes an e-Nash equilibrium. In particular, the players' distri- 
butions are empirical distributions over multisets of size k^ \og{kn)/2e^. 

Proof. The bounded differences inequality (see e.g. [9j p. 8) is the following. Let A be 
a set. Let g : A'^ — > R be a function with the bounded difference property: for all 
i G {1, . . . , n} and for all xi, . . . ,xn,x'^ £ A 

\g{xi, . . . ,xn) - g{xi, . . . ,Xi_i,x'i,Xi+i, . . . ,xn)\ < Ci. 

Suppose Xi, . . . , are independent random variables over A. The bounded difference 
inequality states that for all t > 0, 



XN)-Bg{Xi,...,XN)>t] 
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and 



Pr [E5(Xi, ...,Xn)- 5(^1, • ■■,XN)>t] 

( -2t2 ^ 

< exp 



We apply the above as follows. Let A = [n]^, the set of all pure profiles of fc-person 
games where each player has n strategies. The Xj are samples from some fixed Nash 
equilibrium. Let gi^j be the payoff to player i for using strategy j subject to all players 
using the mixture of N strategies obtained by taking, for each player p, the p-th entries 
of each Xm {"m G [-/V]), and using the uniform distribution on that multiset. 
Note that for the gij functions, Cj < k/N . 

We want the right-hand sides to be less than l/2kn, so that by a union bound, for 
each i and j, we have the payoff for player i using strategy j being close to expected. 
Thus, we want N such that 

eM-^e'/{N{k'/N'))) < ^. 

Solving for we get 

k'^log(2kn) _ 

The theorem indicates that support enumeration is a subcxponential algorithm pro- 
vided that k = o(n^/^) (a brute-force search has complexity 0{n^^) which is subcxpo- 
nential for k = 0(n^/^~'').) Let us remark, that for k = o(n^/^) support enumeration 
is not only subexponential in the input size but also in the number n of different 
strategies of the players. Hence, in this case, the running time is subexponential even 
for more succinctly represented games. 



3 Conclusions 

It seems that only very weak approximation performance is possible with constant- 
support strategies. Work on the special case of 2-player games has mostly addressed 
this problem by constructing linear programs whose solutions have useful properties (and 
are typically non-constant support strategies). For more than 2 players this approach no 
longer works; the corresponding constraints are no longer linear. The challenge seems 
to be to find alternative ways of describing mixed strategies (of more than constant 
support) that have desirable properties. 
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